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Abstract 

Contact tracing is an important control strategy for containing Ebola epidemics. From a 
theoretical perspective, explicitly incorporating contact tracing with disease dynamics presents 
challenges, and population level effects of contact tracing are difficult to determine. In this 
work, we formulate and analyze a mechanistic SEIR type outbreak model which considers the 
key features of contact tracing, and we characterize the impact of contact tracing on the effective 
reproduction number, TZe, of Ebola. In particular, we determine how relevant epidemiological 
properties such as incubation period, infectious period and case reporting, along with varying 
monitoring protocols, affect the efficacy of contact tracing. In the special cases of either perfect 
monitoring of traced cases or perfect reporting of all cases, we derive simple formulae for the 
critical proportion of contacts that need to be traced in order to bring the effective reproduction 
number TZe below one. Also, in either case, we show that TZe can be expressed completely in 

/-I \ 

terms of observable reported ease/traeing quantities, namely Tig = k- - - + where k is the 

q 

number of secondary traced infected contacts per primary untraeed reported case, km is the 
number of secondary traced infected contacts per primary traeed reported case and (1 — g)/g is 
the odds that a reported case is not a traced contact. These formulae quantify contact tracing 
as both an intervention strategy that impacts disease spread and a probe into the current 
epidemic status at the population level. Data from the West Africa Ebola outbreak is utilized 
to form real-time estimates of TZe, and inform our projections of the impact of contact tracing, 
and other control measures, on the epidemic trajectory. 
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1. Introduction 

Gontact tracing has recently gained public attention because of its importance as a control 
strategy in the 2014-2015 Ebola outbreaks. Gontact tracing is a potentially powerful disease 
control strategy in which the close contacts of reported/isolated cases are traced and monitored 
so that if they become symptomatic they can be efficiently isolated and, in turn, cause reduced 
transmissions. There are numerous studies about contact tracing (also known as active case 
Ending) over the past two decades. However, from a theoretical point of view, explicit model¬ 
ing of the contact tracing process has been challenging due to the complexities in reconciling 
forward dynamics of an epidemic with the action of tracers working through a transmission 
chain, along with accounting for particular disease characteristics and public health capabili¬ 
ties. Thus, formulation of mechanistic models of contact tracing that are analytically tractable 
and epidemiologically relevant is an important problem in epidemiology. 
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Because Ebola is typically transmitted through direct contact with bodily flnids of a symp¬ 
tomatic individual and the incubation period is relatively long (average aronnd 11 days), con¬ 
tacts that have been exposed to Ebola virus can be identified, monitored, and, when symp¬ 
tomatic, be isolated to limit spread [Ml El]- Previous outbreaks have been rapidly controlled 
with contact tracing and isolation, along with limiting hospital and fnneral transmission [T6] . 
However, the failnre of initial containment and the snbseqnent nnprecedented scale of the West 
Africa Ebola ontbreak in 2014-2015 have challenged pnblic health anthorities to employ effective 
control measnres. As of Jnne 2015, there have been over 27,000 reported cases in the ontbreak, 
however the incidence has decreased markedly over the past six months due to enhanced con¬ 
trol efforts Therefore it is vital to evalnate how different interventions, such as contact 
tracing, have affected the epidemic and can bring the ontbreak to an end. Direct measurement 
of the impact of contact tracing on an outbreak has recently been considered in relation to 
tnbercnlosis [22], but a general modeling framework for measuring the popnlation-level effect 
of contact tracing with epidemic data is needed for Ebola and other emerging pathogens. 

Previous works about contact tracing have nsed a range of modeling methods from individual 
based models on specific networks to compartmental ordinary differential eqnations at the 
population level. While networks are a natnral setting for contact tracing [25] , their connection 
with traditional epidemic compartmental models (through a mean-field approximation) shonld 
be emphasized. Therefore it is desirable to have a modeling framework which can work in 
both settings with snfficient detail, mechanism and simplicity. Also, branching process models 
[H [321ES] have been utilized for explicit contact tracing structure, but lack both the simplicity 
and compartmental detail inherent in the differential equation model we formulate in this article. 
Many differential equation models have incorporated contact tracing implicitly [Ml El], though 
Hyman et al. [23] considered a deterministic model with explicit contact tracing for HIV, a 
different setting than an emerging ontbreak snch as Ebola. Mathematical modeling of Ebola 
has been considered by many anthors, e.g. [21 ESI EH El Uni Ell El EHl ESI EH EH ESI ESI EH > 
several of Ebola modeling stndies have implicitly incorporated effects of contact tracing (along 
with other interventions) [Ml IMl |31 ESI SI] • However implicit inclusion of contact tracing fails 
to captnre the true effect of tracing on the effective reprodnction nnmber Tie, a key qnantity of 
interest for epidemiologists. 

In this work, we develop a deterministic ontbreak model of contact tracing particnlarly appli¬ 
cable to Ebola epidemics, which explicitly links tracing back to transmissions, and incorporates 
disease traits and control together with monitoring protocols. We calculate the effective repro¬ 
duction number. Tie, and simulate the model under different control scenarios. In particular, 
we perform sensitivity analysis to determine how disease characteristics, variable monitoring 
protocols, case reporting and timing of intervention affect the efficacy of contact tracing in 
controlling the epidemic. In the special case where either traced contacts are always prevented 
from causing fnrther infections (perfect monitoring) or 100% of cases are reported (perfect 
reporting), we derive simple formulae for the critical proportion of contacts that need to be 
traced in order to bring Tie below one. 

Furthermore, from the explicit linking of tracing and transmission in our model, we derive novel 
formulae directly relating the effective reproduction number Tie to contact tracing observables. 
Indeed, define k as the average nnmber of secondary infected contacts traced per primary 
reported untraced case, and q as the fraction of reported cases which are traced contacts. 
Then, in the case of perfect monitoring of traced contacts and the general setting of unknown 
nnderreporting with distinct reported/nnreported case infectious periods, we show that the 
following formula holds: 
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Thus, TZe is simply the product of the average number of secondary infected traced contacts 
per untraced reported case and the odds that a reported case is not a traced contact. If contact 
monitoring is imperfect, but case reporting is assumed to be perfect (100% of cases reported), 
then TZg can also be expressed simply in terms of contact tracing observables. In this instance, 
define as the average number of secondary infected contacts traced per primary reported 
traced case, then: 


TZf> — k ^^^ + fcm 

Utilizing current data and this formula, we form weekly estimates of TZ^ for the Ebola outbreaks 
in Sierra Leone and Guinea, with the main goal of determining the impact of contact tracing 
on the epidemics. Taken together, our model and results quantify contact tracing as both a 
dynamic intervention strategy impacting disease spread and a probe into the current epidemic 
status at the population level. 


2. Model and Methods 

2.1. Base SEIR model 

We begin with an SEIR-type base model consisting of compartments representing susceptible 
(S), exposed or incubating (E), and two distinct infectious groups, namely infectious indi¬ 
viduals which will be hospitalized/reported {Ih) and infectious individuals which will not be 
hospitalized and unreported (lu), along with the decoupled compartment of cumulative hospi- 
talized/reported cases {H). The following differential equation system models the dynamics of 
these populations: 


S'{t) = -^S{t) (hit) + hit)) 

E'it)=(3Sit) ihit) + Iuit))--Eit) 

T 

m = ptE{t) - ( 1 ) 

^ J-h 

/;(() = (i-p)Tw-W(f) 

^ -l-u 

H'{t) = 

J-h 

The parameter (3 represents transmission rate for infectious individuals, r is mean incubation 
period, p is fraction of hospitalized cases, Th is time from infectiousness (symptom) onset until 
hospitalization/isolation and reporting, and is the mean infectious period for an unhospi- 
talized case. Note that we consider an infectious case being hospitalized, isolated and reported 
as the same, whether or not the individual is treated in a hospital or placed in an isolation 
unit. We assume that such a case is reported at the time of hospitalization and neglect any 
possibility of hospital transmission. While transmission in a health-care setting poses signifi¬ 
cant risk to workers and visitors during Ebola outbreak, community transmission accounts for 
the vast majority of cases. In addition, the main focus of our article is a detailed model of 
contact tracing, so we elect to make the model simpler by not including hospital transmission. 
For this reason, we also neglect another route of transmission particular to Ebola; post-death 
transmission due to improperly handled deceased [13], and leave incorporation of these features 
into contact tracing models for future work. 
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(a) 


(b) 


Figure 1: Base Models: (a) Single infectious compartment model (equation (|B1[) in Appendix C) with 


infectious period T and proportion hospitalized p. (b) Multiple infectious compartments model with variables 
for those which will be hospitalized, and unhospitalized, corresponding infectious periods Th and T^, 
and proportion hospitalized p (equation 0). Informally, in (a), the “hospitalization fate” is determined upon 
exit from infectious class, whereas in (b), the “hospitalization fate” is determined upon entry to infectious class. 
Note that the model in (a) is a special case of the model in (b). 


The distinction between reported cases and unreported cases is important when consider¬ 
ing contact tracing since only reported cases can trigger contact tracing. We explicitly sep¬ 
arate infectious individuals into distinct compartments based on whether they will be re¬ 
ported/hospitalized or unreported. Note that the distinct infectious compartments of system 
([^ are in a similar spirit to the model in [29], where the authors divide the infectious class 
into individuals who will recover and those who will die. An alternative base model, the classic 
SEIR model shown in Figure 1(a), has a single infectious compartment I with mean infectious 


period T and, again, fraction of hospitalized cases p. It is proved in the Appendix C that such 
a model is actually a special case of ([^; namely the case where Th = = T and the initial 

conditions //i(0) = p/(0),/„(0) = (1 — p)J(O). Thus, the multi-infectious compartment model 
([^ allows for more generality than the single infectious compartment counterpart, with an ad¬ 
ditional independent parameter. Also, it may be unreasonable to assume that infectious time 
periods and are equal. Therefore, we utilize the more general base model ([^ with multi¬ 
ple infectious compartments, but note that our contact tracing formulation can be incorporated 
into either base model. 

For model ([^, we can consider the time until hospitalization, Th, and the fraction of (untraced) 
hospitalized cases, p, as parameters which can be together targeted by public health measures 
(independent of active contact tracing). The infectious period of unhospitalized cases, T^, can 
be considered a property of the disease. In particular, we can take + (1 ~ '^)Tr 

where 'ip is mortality rate (outside of hospital), and Td and are the times until death and 
recovery, respectively. Note that instead of explicitly separating infectious cases that will be 
unreported into compartments representing death and recovery, we use a single unreported 
infectious compartment for simplicity and since Td ~ (see Appendix C for justihcation). 
The effective reproduction number, TZ^, for model ([^ is given by 


7^e = pSo {pTu + (1 - p)Tu) = 7^o ( + 1 - P 


( 2 ) 


where Sq is the initial susceptible population size and TZq = /3SoTu is dehned as the basic 
reproduction number (without hospitalization of cases). Standard incidence [T5| is often utilized 
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for Ebola models instead of the mass-action transmission assumed here, in which case IZq is 
independent of Sq. We remark that replacing the mass-action (per-capita) contact rate (33 with 
the standard incidence rate (3^ does not affect the results in this article. Also, the population 
size N remains approximately constant over the relevant timescale of the outbreak, thus we 
can think of ~ S'o as being absorbed into the parameter (3. 

2.2. Incorporating contact tracing into dynamic models of EVD 

We incorporate contact tracing into the SEIR model framework by deriving a mechanistic link 
between the retrospective action of contact tracers with forward disease dynamics. First, we 
describe the recommended contact tracing process for Ebola. According to the WHO guidelines 
for contact tracing in Ebola outbreaks [20], contact tracing consists of three basic elements, 
namely, contact identihcation, contact listing and contact follow-up. The process of contact 
identihcation begins upon an infected individual (alive or dead) being reported and isolated. A 
public health officer then conducts an epidemiological investigation to determine the contacts 
of the reported case. In the next step (contact listing), a team of contact tracers tracks the 
contacts identihed from the hrst step. If they are showing symptoms at the time of tracking, 
then the tracers will call for an ambulance team to remove and isolate the individual. Otherwise, 
the tracked contacts are informed of their contact status and told to report any symptoms 
immediately to a provided telephone number, along with restricting contact with other people. 
A surveillance team follows up on the tracked contacts by visiting them regularly during a 21- 
day (“approximately” maximum incubation period) window, and calling the ambulance team 
if symptoms are present. 

In order to model this process, we hrst make the following assumptions on top of the base 
model: 

• The hospitalized/reported cases can trigger contact tracing. 

• Contacts of a reported case are traced with probability cj) and there is a hxed delay 6 
between index case reporting and contact tracking. If a traced contact is infectious at 
this time, they are immediately removed/isolated; otherwise they follow a monitoring 
protocol aimed at early detection and isolation if symptoms develop. 

Note that the probability 1 — 0 accounts for untraced contacts in terms of, both, reported cases 
that do not trigger tracing and “missed” contacts from reported cases that trigger contact 
tracing, i.e. 0 = /i^ where /i is fraction of reported cases that trigger contact tracing and f is 
the average fraction of total contacts from a reported triggering case that are traced. Another 
way of expressing 0 is the following: 

^ total if of traced contacts of reported cases 

total a of contacts of reported cases 

We now formulate the contact tracing terms for incorporation into the base model Let 
Ce(t), {Ci{t)) be dehned as the cumulative number at time t of traced infected individuals who 
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are in exposed (infectious) class when first tracked. Then 


OO 


c'At + S) = 



rate of reporting 
of infectious individuals 


/ 



reported case 
infectious period 
probability density 


t 



integral over 
infectious period of 
length s 


FEit — r + S) (f)(3S{r)dr 

probability still traced transmissions 
incubating t — r 6 . infected at 

units after infection time rG[£—5,t] 


ds, 


OO t 

C'S + i) = f y (1 - FeU - r + 6)WS{r) drds, 

0 t—s 


(3) 


where the incubation period cumulative distribution Fe can be more general than the assumed 
exponential distribution. Since the average infectious period is relatively short, we can assume 
S is approximately constant over the infectious period, i.e. ^ S{r)drds ~ ThS{t). 

Then the total number of traced transmissions at time t + 6, C{t + 6), satishes 

C\t + 6) = C'^it + 5) + C[{t + 5)^ ct)l3S{t)h{t) 


Motivated by these calculations, we dehne the following additional four compartments: i?®, (T^®) 
are exposed individuals who will be traced and incubating (infectious) when tracked; are 
infectious individuals who have been traced during incubation stage and put under monitoring 
protocol; J® are infectious individuals who will be traced/removed (since symptomatic when 
hrst tracked). The modeling framework for incorporating tracing is to separate the fraction (j) 
of reported case contacts that are traced at the moment of transmission, thus establishing an 
explicit link between tracing and reported case transmission. Since the action of tracers upon 
tracking a contact depends upon whether the traced contact is displaying symptoms, we further 
separate transmissions which will be traced into those that will be incubating upon tracking 
{E^) and those who will be infectious upon tracking (i?®). The probability Pe of a contact traced 
individual being incubating when tracked should generally depend on the index case infectious 
period Th, the delay between index case reporting and tracing 6, and the incubation period of 
the traced case r. Indeed Pe can be explicitly calculated from ([^ as: 

„ =_ 9M _= ( 4 ) 

when Fe{x) = (exponentially distributed incubation period) and assuming that S{t) is 

constant over an individual case’s infectious period. For the fraction (1—Pe) of traced individuals 
who are infectious when tracked, their average infectious period (prior to being tracked), Ti, can 
be approximated by considering the expected value of the time after symptom onset when a 
traced contact is tracked (given that they have progressed past their incubation period upon 
tracking), T): 


111 , ^1 
, where F = — 
Ti T Ti n 


Th+S 


Th + 6- 


xe 


—{x—r)lT 


f 

Jo 


ds 


dx 


dr (5) 


with T = pTh + (1 — p)Tu is the (weighted) average infectious period independent of contact 
tracing. The above formula ([^ is explained in more detail in the Appendix A We introduce 
the following additional assumptions and parameters: 
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Figure 2: Mechanistic model of contact tracing: Compartments dividing transmissions into those that are 
untraced, E (whom will be unreported or reported Ih and can trigger tracing), and those that are traced 
(whom will be tracked when incubating or infectious Ef). The tracing process is depicted with the relevant 
sequence of events for the traced individuals. Note that higher order tracing events are not illustrated in the 
diagram, but are included in the model Q. 


• The infectious monitored individuals, J®, have infectivity fim and mean infectious period 
(time from symptom onset until hospitalization) T^. The parameters and ffm depend 
on the strictness and efficacy of the monitoring protocols. Note that the infectious indi¬ 
viduals traced during their infectious period, /*, have infectivity {3 prior to tracking and 
mean infectious period Tp 


• Contact traced infectious individuals, and /*, trigger contact tracing after being iso¬ 
lated/reported or tracked, which is termed higher order tracing. As with first-order trac¬ 
ing, the fraction 0 of their contacts will be traced. In particular, we assume that all 
II individuals will be tracked, reported and trigger contact tracing, even if they die or 
recover before being traced. We introduce the parameters p® (p*) representing the prob¬ 
ability a higher order traced contact will be tracked during their incubation period after 
being infected by an If (If) individual. 

• The probabilities p® and p* can be calculated similar to Pe (|^, except using index case 
infectious periods Tm and T) respectively. Note that the infectious period for a higher 
order tracing infection caused by a If individual should be smaller than Tj. But this would 
require additional compartments, so we choose to approximate the infectious period as 
Tj and “close the system” here. 


With all of these assumptions, we arrive at the following model: 

= -l3S{h + F)-l3Sli-l3^SIl 
E' = flSh + (1 - 0) tdSh + (1 - (t>)PSli + (1 - 0)0™^/: - -E 

T 

{EtY = p,4>0SI„ + piY>PSIi + pt4>l3,nSI‘-tE; 

{EiY = {i-p,)4,DSh + (i-A)4'iirs+(i-pi)4>ii.nsn-tEi 

h = ( 6 ) 

T Ih 
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The model provides a detailed dynamic description of contact tracing, reconciling the forward 
dynamics of the epidemic with the action of tracers working backward along a chain of contact. 
The variable and parameter descriptions are listed in Table [Tj Despite the nnmber of compart¬ 
ments, several of the parameters are dependent on basic independent parameters. Also, efficacy 
of contact tracing is determined by both disease characteristics (throngh probability Pe and Ti as 
fnnction of incnbation period r and infectious periods T^, T„) and monitoring/response efficacy 
(through delay 6 in calculation of Pe, and parameters (3m and T^). 

There are limitations and corresponding generalizations of the model for future work. For 
example, it is assumed that the incubation and infectious period are exponentially distributed, 
and infectivity (transmission) is uniformly distributed throughout the infectious period. These 
are inherent assumptions of the ordinary differential equation system, however they may over¬ 
simplify the true epidemiological distributions. Most likely these simplihcations underestimate 
the impact of contact tracing. Indeed, an exponentially distributed incubation period puts too 
much weight in small incubation times, which decreases the value of pe- A more appropriate 
distribution, for instance the gamma or Weibull distribution fit to Ebola patient incubation pe¬ 
riod data [SSIE], will result in higher vales of Pe, which increases the traced contacts that are 
monitored. In addition, assuming a uniform infectivity distribution, i.e. constant transmission 
rate, may not be consistent with the fact that viral load will steadily increase after symptom 
onset and peak infectivity will occur some time after the infectious period begins. Thus, early 
removal of infectious individuals may drastically reduce transmission potential when these in¬ 
dividuals are removed before peak infectivity |12]. Fraser et ah have considered contact tracing 
in an infection-age model with general infectivity distribution, however they did not explicitly 
link tracing and transmission 12 ]. Infection-age structured or stage structured models can 
include the aforementioned disease features, along with post-death transmission, and incor¬ 
porating tracing into these types of models is left for future work. Despite the limitations, 
our ODE formulation offers an analytically tractable, mechanistic model of contact tracing in 
disease outbreaks. 


3. Results 

3.1. Formulas for TZ^: control and probing properties of tracing 

In this section, we consider special cases of the general model (|^ in which explicit formulas 
for TZe are simple and can be expressed completely in terms of contact tracing observables. 
First, we consider the special case that Pe ~ 1 and (3m = 0, i.e. long incubation period relative 
to infectious period, fast tracking and perfect monitoring. Although the formula (|^ always 
yields Pe < 3 for an exponentially distributed infectious period and incubation period, it is 
reasonable to assume that p^ = 1 (traced individuals are always tracked during their incubation 
period) when the incubation period is large relative to infectious period. In other words, we 
are assuming that contact tracing is “perfect” in the sense that traced individuals do not cause 
any secondary infections (because of effective monitoring and tracking). Later in this section, 
we will relax this assumption and consider imperfect monitoring (3m > 0 in the case of perfect 
reporting (p = 1). 

The effective reproduction number, in the case of “perfect” tracing {(3m = 0, Pe = 1) 
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Table 1: Model variables and parameters. 

Variable/Parameter 

Description 


Variables 


S{t) 

m 

Susceptible individuals 

Exposed (incubating) infected individuals 

i{t) 

Infectious individuals 

Em 

Exposed individuals who will be traced and incubating when 
tracked 

Em 

Exposed individuals who will be traced and infectious when 
tracked 

m 

Monitored infectious individuals (previously contact traced 
when incubating/not symptomatic). 

im 

Infectious individuals who will be traced and removed upon 


tracking (since symptomatic when first tracked). 

Independent parameters 


/S 

Transmission rate (for unmonitored infectious) 

T 

Mean incubation period 

Th 

Time until hospitalization/reporting independent of contact 
tracing 

Tu 

Mean infectious period for unhospitalized case 

P 

Fraction of untraced cases that are isolated/reported (and can 
trigger tracing) 

0 

Probability that contact of reported case is traced 

(5 

Time delay between case reporting and contact tracing 

f^TTL 

Transmission rate for monitored infectious (//) individuals 

T 

^ m 

Mean infectious period for monitored infectious individuals 
(time between symptom onset and isolation) 


Dependent parameters 


Pe 

T. 

Probability of (hrst order) traced contact being incubating 
upon tracking (calculated by formula (j^, depends on r, T, 5) 

Mean infectious period (prior to being tracked) for contact 
traced infectious (/*) individuals (calculated by formula 
depends on r, T, 5) 

Pi 

Probability of (higher order) traced contact (infected by // 
individual) being incubating upon tracking (calculated by for¬ 
mula (4|) using r, T^, 5) 

Pi 

Probability of (higher order) traced contact (infected by /* 

individual) being incubating upon tracking (calculated by for¬ 
mula (|4|) using r, T*, 5) 


simplifies to the following (calculation in Appendix B): 

7^e = 7^o - 0) + 1 - , 

where TZq := (5SqTu is the basic reproduction number, or more precisely the reproductive 
number in the absence of contact tracing and hospitalization, p is the fraction of (untraced) 
cases which are reported and 0 is the fraction of reported case transmissions (contacts) which are 
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traced. Notice that in the special case of one mean infectious period for all cases, = T, 

the formula for TZf, reduces to the following: 

Tie = Tlo{l - p4)) 

Thus, in this case, the contact tracing effort needed to reduce the effective reproduction number. 
Tie) below one is: 

= 1 ^ (P0)c = 

/Vo 

where is the critical proportion of total cases which need to be traced in order for He = 1- 
This is the product of the fraction of cases which are reported, p, and the fraction of contacts 
(more precisely, transmissions) of reported cases which are traced, 0. This formula is similar 
to what was obtained in special cases of two branching process contact tracing models |32l [26] , 
but our approach differs from these previous works and is much simpler to derive. Also, note 
the similarity with the well-known simple herd immunity vaccination proportion threshold 

Pc = l-l/7^o m 

The goal now is to directly connect He with parameters that can be inferred by case and contact 
tracing data. Considering the link between contact tracing and transmission, we rewrite the 
fraction 0 of transmissions from a reported case which will be traced: 

^ # of traced contacts per reported case i 

total 70 of contacts per reported case n’ 

where these quantities are defined in a wholly susceptible population, i.e. the total population 
size is = Sq. Now the transmission rate (3 can be written as 0 = pc, where p is the probability 
of transmission per contact and cSq is the contact rate. Now for a (untraced) reported case 
n = cSoTh = f3SoTh/p. Define the parameter k as the average number of secondary infected 
traced contacts identified per (untraced) reported case. Then k := pi = (3SoTh/(f>. Note that 
the value of the parameter k can be estimated from data and records. 

Another important parameter that can be obtained from case and tracing data is the fraction 
of reported cases which are traced contacts, denoted by q. We find that q can be expressed as 
the following constant in terms of model parameters: 


_ <P _ 

0 + (1 - 0)p (1 - p)Tu/Th' 


(7) 


Indeed, one can interpret the denominator (multiplied by p) as the probability a case is reported 
considering the different potential routes to reporting: traced; untraced, infection caused by 
reported case; untraced, infection caused by unreported case. Also, notice in the case of 100% 
reporting {p = 1), q reduces to g = 0. In addition, from a dynamic perspective, we define the 
fraction from the cumulative reported cases over some time window [f — a, t] which are traced 
contacts: 


Qait) := 


^ lia Us) ds 


Tm ft-a ^c(s) ds + j, dh{s) ds 


In the Appendix B, it is proved that qa{t) ~ q for sufficiently large t, small a; 


convergence is quite rapid as illustrated in Figure 3(b) for the case a = 7 days. 

Combining the formula ([^ and k = (dSoT^cp, we find the following formula for the 
reproduction number He'. 


and the 
effective 


He = k 



( 8 ) 
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Figure 3: (a) Contour map of TZe with respect to k and q from formula ([^. (b) The weekly cumulative fraction 

of reported cases that are traced contacts, g(t), converges to approximately (or to exactly) the quantity q given 
in formula Q. 


Thus, TZe can be formulated simply as the product of the number of infected traced contacts 
per (untraced) reported case and the odds that a reported case is not a traced contact in the 
case of perfect tracing. Notice that this formula is composed of two quantities that can be 
determined by case data and contact tracing records. 

The intuition behind the formula for TZe is clear in the case where 100% of the cases are 
reported. Indeed, consider a reported case which has caused k + m secondary infections, where 
k of these are traced (consistent with notation) and m are untraced. Note that this reported case 
is necessarily untraced since we are assuming that traced individuals are perfectly prevented 
from causing further infection. If this reported case represents an average infectious individual 
in the population and the level of contact tracing here represents the average in the population, 
then the effective reproduction number TZe should be m since this is the number secondary 
infections which can cause further transmissions. Utilizing the formula (|^ applied to this 
transmission chain, we correctly hnd 


TZe = k 



k 


m/{k + m)\ 
k/{k + m) ) 


= m 


(9) 


The somewhat surprising result is that formula (|^ still holds with an arbitrary proportion of 
cases reported and distinct infectious periods for reported and unreported cases. Thus contact 
tracing observables can theoretically provide an exact measure of TZe even in the presence of 
underreporting (p < 1) and distinct infectious periods (T^ < T„). Our simple explanation 
utilizing the single index case and single level of transmission chain above will not suffice in the 
case of p < 1 since we can add unreported cases which would not appear in the calculation (|^, 
but should appear in TZe- However, a simple intuitive line of reasoning is that underreporting 
will cause cases to not be counted in individual transmission chains, but will also add to the total 
reported cases since additional reported cases that are transmissions of unreported cases will 
appear, thus balancing out the effect of underreporting. A more complete heuristic argument of 
this result with transmission chains probably requires multiple levels and multiple index cases 
of an outbreak transmission tree. Of course, considering explicit transmission trees more in 
depth leads to thinking about contact tracing on networks, which is a subject in itself. We 
leave this as ongoing and future research, and provide further discussion about extending our 
work to networks in Section 01 
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In order to quantify the impact of control measures on reduction in reproduction number, 
we can calculate the relative change in reproduction number after applying the intervention. 
Dehne TZ° as the effective reproduction number when there is only case reporting/hospitalization 
independent of contact tracing (e.g. passive case hnding HU) and no contact tracing. Thus 

71° = Tlo + 1 — pj. The relative changes in reproduction number with respect to the 
distinct interventions, passive case hnding and contact tracing, are as follows: 


relative change in IZe with tracing 
relative change in TZq with only passive case hnding 

relative change in TZq with passive case hnding & tracing 


Tll-TZ, 

Tip-Til 

TZq 


TZp — Tie 
Tip 


PQ 

l-g(l-p)’ 

( 10 ) 

l-g(l-p) 


With these formulas, we can quantify the impact of contact tracing and compare with the 
strategy of only passive case hnding. Notice that the particular role of contact tracing in re¬ 
ducing the ehective reproduction number increases with q. Thus q provides a simple observable 
measure of the impact of tracing, however the relative change in Tie also depends on fraction 
of cases reported independent of tracing, p. 

Next, we consider another special case, in particular the assumption of perfect tracing is 
relaxed. Suppose that there is perfect reporting and tracking, but imperfect monitoring, i.e. 
p = Pe = 1, /3m > 0. In this case of 100% reporting, the reproduction number in the absence 
of contact tracing is simply (3Th, which we denote by Tl := (TSpTh. In addition, dehne the 
reproduction number of contact traced (monitored) individuals as 71^ = PmSpTm- Then 9 = 
Tlm/Tl represents the reduction in secondary cases of a traced individual compared to an 
untraced individual. As in the previous case. Tie reduces to a simple formula: 


Tie — (1 ~ 0 ) 3 ^ + (f^Tlm 


Thus, in this instance, the critical proportion of total cases which need to be traced in order to 
bring Tie below one is: 


Tie = 1 ^ 


( 1 - 0 ) 1 



In order to describe Tie in terms of contact tracing observables for this case, dehne k = (f)Tl and 
km = (fTTlm to be the average number of traced infected secondary cases caused per primary 
reported untraced infected and traced infected, respectively. In the case of 100% reporting 
(and Pe = 1), P = 1, it is not hard to show that the fraction of reported cases which are traced 
contacts, q, is simply g = 0. Thus, we hnd that 

Tie = k ^^ -|- km (12) 


The impact of tracing in this case can simply be quantihed, since the ehective reproduction 
number without tracing is Tl^ = Tl. The relative change in Tie with tracing is: 


Tl° 


= q{l-9) 


(13) 


Together formulas (12) and (13) quantify both Tie and the particular impact of contact tracing 
on Tie for a given tracing/monitoring efficacy (in terms of secondary case reduction 6 due to 
tracing) in the case of 100% reporting. 
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3.2. Weekly point estimates of Tie for Sierra Leone and Guinea 

The results in the previous section suggest that the effective reproduction number, Tie, can 
be estimated directly from contact tracing and reported case data via simple formulas derived 
from special cases of our mechanistic model of contact tracing. In this section, we will apply 
formulas (I^ and (12) to recent contact tracing and reported case data from the Ebola outbreak 


in Sierra Leone and Guinea in order to obtain some weekly point estimates of Tie for these two 
countries under the assumption of the special cases discussed in the previous section, and in 
particular different tracing efficacies. Then, we calculate the effective reproduction number 
without contact tracing, 71°, in order to gauge the potential impact of contact tracing on the 
epidemic. First, we briefly compare some other methods of estimating the effective reproduction 
number in outbreaks from the literature. 

A common method for estimating the reproduction number is to fit a deterministic model 
to incidence data in order to infer unknown parameters in the underlying model and then 
calculate the reproduction number from standard techniques (e.g. next-generation method) 
[311 El [131 ESI E3]- Drawbacks to this approach are the lack of generality, context-specific 
assumptions, potential identifiability issues along with the fact that parameters vary throughout 
the epidemic uma]. More recent statistical approaches based on the work of Wallinga and 
Tennis utilize case incidence data and knowledge of the distribution of the serial interval to 
estimate time-varying values of Tig throughout an epidemic HllISl. These statistical methods 
are more generic in the sense that the underlying model is an infection network, however their 
implementation can be complex and requires prior estimates of serial interval distribution. 

We propose a novel method for estimating up to date values of Tig based on contact tracing 
and reported case observables by utilizing formulas (1^ or (12). In particular, with reported 


case data and contact tracing records detailing, both, the traced infected individuals and their 
infector, we can directly utilize these formulas for calculation of Tig (which hold under the 
different simplifying assumptions of the previous section). Indeed, suppose we have the reported 
case count at a particular time point t (cumulative reported cases during time window t — a 
to t), information about whether these cases are known traced contacts, along with records 
discerning the number of traced infected contacts which are derived from these reported cases. 
Then the number of traced infected identified per untraced and traced reported case, k and 
km, can be calculated at this time t. Furthermore, the fraction of reported cases which are 
traced contacts, q, can be calculated for the subsequent time windows of reporting for these 
traced contacts derived from the infecting reported cases at time t. Then, the formula (12), 
Tig = k{l — q)/q + km, can be utilized with an appropriate averaging technique to estimate 
the effective reproduction number. Tig, at time t. Thus, with a time-series of this case/tracing 
data, the corresponding point estimates of Tig can be calculated in this way for each feasible 
time point in the series. 

In comparison with the other methods, our proposed procedure is similar to the statistical 
approach first described by Wallinga and Tennis BDl. however, our approach shows a potential 
way to, both, incorporate specific contact tracing data and remove the need for utilizing (and 
estimating) the serial interval distribution. We remark that Cauchemez et ah utilize contact 
tracing data in order to infer the serial interval and use this to “reconstruct tracing information 
for untraced cases” assuming perfect reporting [7j. Our work suggests that contact tracing 
and reported case data, by themselves, may be sufficient to estimate Tig based on formula 
(12). Also, since our formulas are derived from a mechanistic differential equation model, 
we provide a bridge between compartmental deterministic models and the statistical methods 
based on reconstructing an infection network. With our general detailed model of contact 
tracing in the background, we can explore the effect of relaxing simplifying assumptions about 
the tracing process. In addition, the formulas (10) and (13) provide a fresh perspective on 
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Figure 4: WHO data of weekly reported and traced cases for Guinea (a) and Sierra Leone (b) for mid January 
to mid April. The fraction of reported cases that are known traced contacts, q, is plotted versus week for Guinea 
(c) and Sierra Leone (d). The data was obtained from weekly WHO situation reports ([I]). 


real-time evaluation of efficacy of control measures, in particular contact tracing, which was a 
problem considered in j7j. 

Recall that the formula was derived to hold with arbitrary underreporting (p < 1) in the case 
of perfect monitoring, = 0, and hold for 100% reporting (p = 1) in the case of imperfect 
monitoring, > 0. It should be possible to obtain the k and km directly from contact tracing 
data, however, we were not able to hnd such data, and we recommend public health officials 
to keep and publicly share the necessary records for the estimation of average traced infected 
contacts found per reported case. Despite this limitation of direct data, in a similar spirit to 
the methods of Wallinga and Tennis [ID] , we utilize estimates of the serial interval distribution 
for Ebola from previous works HU, in order to infer likely “averaged” infection networks, from 
which values of k and km are obtained under different scenarios of tracing efficacy. In this way, 
we calculate weekly point estimates of TZe utilizing formula (12) with weekly case and tracing 
data, along with incorporating previous estimates of the serial interval distribution of Ebola, 
in order to evaluate the potential impact of contact tracing on the epidemic. 

The WHO [T] has recently published weekly counts of traced individuals among reported 
cases in Guinea and Sierra Leone as part of the weekly WHO situation reports (see Figure]^. 
Recall, by (12), the effective reproduction number can be estimated a.s TZ^ = k ( + km, 
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where k and are the infected traced contacts identihed per nntraced and traced reported 
case, respectively. Define Cj and Tj as the cnmulative reported cases and traced (reported) 
cases in week j, respectively. The serial interval, which we denote by a, is defined as the time 
from illness onset in the primary case to illness onset in the secondary case [H]. The serial 
interval of Ebola has been estimated to be a Weibnll distribntion (random variable denoted 
W) with scale and shape parameters estimated at 13.6 and 2.6, respectively [H]. Therefore, 
we estimate the probabilities pi, p 2 , and ps of the serial interval being < 7 days, 7—14 
days, and 14 — 21 days, as: pi = P(1T < 7) = 0.18, p 2 = P(7 < hh < 14) = 0.51, and 
P 3 = 1 — P 2 — Pi = 0.31 ~ P(14 <W < 21). Snppose that the tracing/monitoring reproduction 
number reduction is 0 < 6^ < 1, i.e. IZm = OIZ. Then, the average number of traced infected 
identified in week j + n per untraced and traced reported case from week j, k^’"^ and k^, and 
the fraction of reported cases that are traced in week j + n, qj+n, are: 


k3,n ^ 


T. 


j+n 


Q-(1-0)T/ 


_ 


9T, 


j+n 


T, 


Q-(1-0)T/ 


Qj+n 


j+n 


a 


j+n 


since k^’'^{Cj — Tj) + k^^Tj = Tj+n and k^^ = 

Thus, we estimate the (expectation of) effective reproduction number of week j, E,{TZe'^] 


as: 


e(kP) = P(n — j — — j weeks) ■ \ n — j — l<a<n — j weeks) 


n 





Therefore is estimated as the ratio of the sum of untraced and a fraction 9 of traced 
reported cases, weighted (based on serial interval distribution) over the weeks following j, to 
the sum of untraced and this fraction 9 of traced reported cases during week j. In the case 
of perfect tracing, 6* = 0, only weekly untraced reported cases {Cn — T„) are involved in the 
calculation of TZe- Conversely, in the case that tracing has no impact, 9 = 1, all reported cases 
(traced or untraced) are equally weighted in the calculation of Tie, thus the specific tracing 
data giving traced reported cases per week (T„) is not utilized. We note that a more rigorous 
maximum likelihood analysis can be developed with our approach similar to |10] and confidence 
intervals can be generated for the weekly estimates of Tie- However, since we are more interested 
in a relatively straightforward application of formula (12) to data and quantifying the potential 
impact of contact tracing, we simply calculate the expected value of the weekly reproductive 
number, TZP, by the law of total expectation. 

In Figure 5(a) and 5(b), the weekly point estimates of Tie for Guinea and Sierra Leone are 
obtained using the formula (14) for different levels of tracing efficacy; reproduction number 
proportion 9 = 0 (perfect tracing), 6' = 1/2,6' = 1 (no impact of tracing). Figure 5(a)| and 


5(b) I shows the estimates of Tie in the case 0 = 1/2 alongside the weekly reported cases for 
Guinea and Sierra Leone, respectively. Notice that, especially for Guinea, the reproduction 
number Tie fluctuates substantially about the epidemic threshold Tie = 1 over the time period 
depicted. These fluctuations coincide with changes in the weekly incidence curve with a delay 
representative of the serial interval. For the Figures 5(a) and 5(b), one should keep in mind 


that formula (14) is utilized to calculate Tie, thus reported cases and the amount of traced 
cases affect the values of Tie- In comparison with Guinea, the estimates of Tie for Sierra Leone 
have a more decreasing trend with smaller values and less fluctuation, and this pattern also 
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Figure 5: Weekly estimated values of TZe ( in (14) ) for both Guinea (a) and Sierra Leone (b) assuming 

tracing reproduction number proportion 9 = 0, i.e. perfect tracing (blue), 9 = 0.5 (purple), and 9 = 1 (orange). 
In (c) and (d), estimates of TZe assuming 9 = 0.5 are shown alongside the weekly case count curve for Guinea 
and Sierra Leone, respectively. 


holds true in the reported case count. In addition, Figures 4(c), 4(d) show that the fraction 
of traced cases, q, is larger for Sierra Leone. This suggests that contact tracing has been more 
successful in Sierra Leone over the time period, as will be discussed below and further depicted 
in Figure Supporting this notion, delving deeper into the given data in Figures 4(c) and 


we notice that the auto-correlation of the time-series of q for Guinea is more random 


than that of Sierra Leone. Indeed, for Sierra Leone the data for q has larger auto-correlation 
corresponding to time lags less than three weeks, which is expected based on the serial interval 
of Ebola. This suggests that perhaps Sierra Leone has had a more constant tracing effort when 
compared to Guinea, which may explain the different outcomes of the countries over this time 
period. Further comparison of the countries’ responses and epidemics is an important topic 
and will be studied in future work. 

The reproduction number fluctuates more when assuming perfect tracing (0 = 0), as only the 
untraced cases are assumed to cause secondary infections and the fraction of reported cases 
that are traced contacts, q, is not hxed from week to week. Gertainly the limited amount 
of tracing data, combined with variability in q, may cause error in our calculation of TZe- 
In addition, recall the simplifying assumptions of either perfect tracing or perfect reporting 
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under which formula (12) holds. Also, it is unclear how much of the variation in is due to 


observation error or underlying stochasticity in the transmission process versus actual changes in 


reproduction number (transmission potential). The success of formula (12) in hnding the “true 


expected value of the (possibly time-varying) effective reproduction number IZe will be explored 
in future work with simulated data. Also, as mentioned previously, a major limitation is our 
implicit inference of the parameter k and based on the estimated serial interval distribution. 
Despite the limitations, our main goal in this section is to illustrate the potential value in 
utilizing contact tracing observables to estimate IZf. through formula ([T^. We emphasize that 
our general approach for calculating TZe with can avoid the need to utilize a serial interval 
distribution. Instead, contact tracing observables can be directly used to calculate k and km in 


the formula (12) if the detailed tracing records are available. 


Our methodology can quantify the impact of contact tracing and case hospitalization indepen¬ 
dent of tracing (passive case hnding) in reducing the reproduction number utilizing formulas 


(10) and (13). In particular, under the assumption of perfect tracing {6 = 0), for given level 
of reporting of untraced cases (fraction p), the effective reproduction number without contact 
tracing, 7^°, can be calculated with the weekly estimates of Tie, the data for fraction of reported 
cases that are traced contacts (g), and formula (10). In Figures 6(c) and 6(d), 71° is plotted 
above Tie for two different levels of reporting untraced cases, 75% and 100%, for both Guinea 
and Sierra Leone. We also consider impact of contact tracing in the case of imperfect trac¬ 
ing (and 100 % reporting), namely for tracing specihc reproduction number reduction of 50% 
{9 = 0.5) in Figures [6(a)| and 6(b) From the estimates of Tie and data for q, we also determine 


the reproduction number, TZq, without contact tracing or any case hospitalization given values 


of p and Tfi/Tu in the case of perfect tracing. Figures 6(e) and 6(f), compare weekly values of 
Tie with 71° and TIq for p = 0.75 and p = 1 (assuming T/j = 4 and = 9.8). Thus we can 
utilize formulas (1^, (12), (10) and (13) with contact tracing observables in order to determine 


the reduction in reproduction number due to contact tracing and passive case hnding, which 
allows for evaluating the ehectiveness of these diherent interventions. 

These hgures illustrate that contact tracing in both countries had an impact on Tie from 
January to the middle of March, but the impact has been much more dramatic in Sierra 
Leone. Indeed, even when assuming 50% efficacy of contact tracing/monitoring {9 = 0.5), our 
calculations show that contact tracing has pushed the average ehective reproduction number 
over this time period below one in Sierra Leone. Our results indicate that contact tracing ehorts 
may need to be increased in Guinea in order to ensure epidemic control and to reduce the risk 
from unknown chains of transmission. The target goal of WHO is “for 100% of new cases to 
arise among registered contacts, so that each and every chain of transmission can be tracked 
and terminated”. While this is certainly an ideal target, it is important to at least reduce and 
stabilize Tie substantially below one in order to control the epidemic with limited resources and 
end the outbreak in a timely manner. 

3.3. impact of contact tracing and control on disease dynamics in general model 

In order to investigate the coupled effects of contact tracing, case reporting and disease char¬ 
acteristics on epidemic spread, we analyze the general model ([^. In particular, variable tracing 
responsiveness and monitoring protocols, along with public health campaigns independent of 
contact tracing, are evaluated according to their impact on the effective reproduction number 
He and cumulative case count. In addition, computation of He in this general setting helps to 


quantify potential underestimation of He in formulas derived in Section 3T for the simplifying 
cases of “perfect” tracing speed {pe = 1), and either perfect monitoring efficacy {(dm = 0) or 
perfect reporting {p = 1). 
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Figure 6: Impact of contact tracing: Weekly estimated values of TZ° (red) (reproduction number without tracing, 
calculated using (10) and © ) are plotted above the estimated values of TZg for both Guinea (a,c,e) and Sierra 
Leone (b,d,f) assuming traced contact reproduction number reduction 9 = 0.5 (a,b) (purple) and p = 1; and 
perfect tracing, i.e. 0 = 0 (c-f) (blue), for two different levels of reporting of untraced cases, 75% (green) and 
100% (red). In (e) and (f), the basic reproduction number, TZq, without any case hospitalization is also displayed 
assuming Th = i and T„ = 9.8. 
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Figure 7: (a) Contour map of TZg when varying time until hospitalization/reporting (T/j) and proportion of 

reported case contacts that are traced {(j>) in the special case of perfect monitoring = 0). Note that for 
the other parameters, we set: f3 = OA/N where TV = 6 x 10® (population of Sierra Leone), infectious period for 
unreported infected T„ = 9.8, proportion of untraced infected hospitalized p = 0.75, incubation period r = 10 
and tracking delay 5 = 2. Note Pe, pi, pi, Ti are given by formulas derived earlier, (b) The effect of varying (3^. 
(monitoring efficacy) and (j). Note that Tm = 2 and all other parameters are as in (a). 


The effective reproduction number can be reduced by both contact tracing and public 
campaigns aimed at increasing case hospitalization and decreasing time until hospitalization 
among untraced individuals (passive case finding). Considering both types of control strategies 
together, we find that passive case finding has a synergistic effect on the efficacy of contact 
tracing and reduction of TZe- In particular, Figure 7(a) shows that reduction in time until 
hospitalization (independent of tracing), Tp, has a nonlinear impact on the proportion of re¬ 
ported case contacts that are traced, 0, required to bring Tie below one. Indeed, reducing Tp 
has the effect of both decreasing the reproduction number and number of contacts of untraced 
individuals and increasing the efficacy of tracing through improvements in tracing speed {pe is 
increased, T* is decreased). In the Appendix D, Figure 1(a) illustrates a similar synergistic 


effect observed for increasing case hospitalization/reporting, p. 

Consideration of the details of the contact tracing process leads to questions about how distinct 
tracing and monitoring parameters affect the disease dynamics. We explore how the extent of 
contact tracing (through proportion of reported case contacts that are traced, 0), strictness of 
monitoring protocol (transmission rate of traced contacts, /3m) and delay between index case 


reporting to tracing (<5), affect Tie (Figure 7(b), 1(b)) in Appendix D). We generally find 


that contact tracing more individuals has a larger impact then the strictness of monitoring 
and tracing speed. Indeed, increasing the fraction of contacts traced, 0, is a more “upstream” 
intervention and the relatively long mean incubation period r (~ 10 days) of Ebola allows 
for more time to track contacts. This long incubation period, combined with the fact that 
infectiousness and symptoms onset are coincidental and transmission occurs only through direct 
contact, are disease traits which make Ebola particularly amenable to contact tracing. Our 
observations are corroborated in a global sensitivity analysis of Tie with respect to the model 
parameters (Figure D.2 in Appendix D). Overall, this sensitivity analysis suggests that the 
proportion of (untraced) cases hospitalized/reported, p, has the largest impact on Tie, followed 
by time until hospitalization (independent of tracing) Tp, transmission rate (3 and proportion 
of reported case contacts traced 0. 

In addition to the effect on Tie, we investigate the impact of parameters and various control 
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Figure 8: (a) Simulations of cumulative reported and total cases before and after contact tracing/enhanced 

control measures. The before represents the initial exponential increase of the epidemic in Sierra Leone with data 
fit by model using parameters (3 = QA/N (as above), p = 0.5, yielding = 2. After the vertical dashed line (48 
days from t = 0) shows the trajectory of reported (solid) and total (dashed) cumulative cases for four different 
scenarios: no control {TZe = 2); Th = 5, p — 0.75 {TZg = 1.0068);Tft = 5, p = 0.95 {Tie = 0.7531);T;i = 3, p = 0.75 
{Tie = 0.8679). Also j3m = 0.5,0, p = 0.75, 5 = 2, Tm = 2 and r = 10. Note that actual cumulative reported 
cases for Sierra Leone is « 9,500 at end time of simulations ( Dec. 31). (b) Cumulative weekly reported 
and traced cases for the scenarios in (a), (c) Weekly fraction of reported cases which are traced contacts (d) 
Cumulative total, reported and traced at day 215 for the three scenarios in (a) when tracing (and enhanced 
control) begins at day 24, 48 and 72. 


scenarios on the cumulative case count in the model (@. Assuming an initial reproduction 
number of T^e = 2 (with Tp = 5, Tu = 9.8, p = 0.5) for the 48 days between May 27 and 
July 14 (a reasonable £t for the initial exponential epidemic growth in Sierra Leone mm), 
we simulate forward the cumulative cases under different control scenarios in Figure The 
simulations show that minimizing effective reproduction number TZ^ does not always align with 
minimal cumulative cases over a time period (or over the entire epidemic; example shown in 
SI). In particular, reducing time until hospitalization toTp = 3 with contact tracing proportion 
(j) = 0.75 has a larger impact on cumulative cases over the time period t = 48 to t = 210 
than the case when Tp = 5, (j) = 0.95, even though the respective TZe values have opposite 
ordering. Also, the number of traced reported cases for the hrst scenario {Tp = 3,0 = 0.75) 
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is substantially less than the other scenario {T^ = 5, 0 = 0.95), which may be important given 
limited contact tracing resources. While our model does not explicitly account for tracing 
susceptible individuals, the number of total traced individuals can be thought of as proportional 
to the number of traced infected with proportionality constant 1/p where p is the probability 
of transmission per contact. Simulations show that timing of the control measures are crucial 
in reducing both cumulative cases and the number of contacts which need to be traced (Figure 
8(d)[ ). Global sensitivity analysis of total cumulative cases at t = 148 is similar to the sensitivity 
analysis of Tie, except that 0 has slightly less impact, and the tracing delay 5 and incubation 
period r (both of which affect the probability of reaching an infected contact during incubation 
period, pe) gain more importance. Summarizing the results, we hnd that passive case hnding 
along with contact tracing, and early initiation of the interventions are key components in 
controlling an Ebola epidemic under the constraint of limited resources. 

To conclude this section, we remark upon how the simple formulas derived for Tig under 
the assumption of “perfect” tracing or perfect reporting differ from the more general case. 
Clearly the case of “perfect” tracing {Prn = 0, Pe = 1) results in lower values of Tie than in 
the imperfect setting of (3m > 0, Pe < 1- However, as shown in Figure 7(b), the simplihcation 
of perfect tracing does not lead to substantially lower values of Tie than the imperfect case, 
especially in the general case of “perfect monitoring” {(3m = 0, Pe < !)• A more interesting 
question to explore is the following: how well does the formula in terms of tracing observables. 
Tie = k{l — q)/q + km (where k and km is # of infected traced per reported untraced and 
traced case, respectively, and q is fraction of reported cases that are traced), hold in the general 
setting? In Figure [8(^ it is shown that the weekly fraction of reported cases that are traced, 
q{t), converges to a constant value for each scenario. Thus, a constant value q seems to be 
possible to dehne more generally, but this quantity, along with k and km, may be difficult to 
formulate in a simple expression of model parameters. It is conceivable that such a formula can 
hold in general by a similar heuristic argument for the formula holding with underreporting 
given in Section 3T for the case of perfect tracing. Indeed, while imperfect reporting and 
tracing lead to “missing cases” in individual transmission chains of an index reported case, 
more reported cases are generated throughout the network, potentially balancing out the effect 
of imperfect reporting and tracing. 


4. Discussion 

During the 2014-2015 West Africa Ebola epidemic, it was clear that control measures were 
insufficient as the number of cases grew rapidly, but ultimately improvements in individual 
behavior, community engagement, early case isolation and safe burials have slowed down the 
disease trajectory. To have the best chance of averting another epidemic of Ebola or another 
emergent pathogen, it is important to know which particular interventions were most responsible 
for reversing the momentum. In this study, we focus on analyzing the effectiveness of contact 
tracing along with passive case hnding for the West Africa Ebola outbreak and other potential 
epidemics in the future. 

We have developed a mechanistic differential equation model of the contact tracing process 
in an infectious disease outbreak, particularly relevant to Ebola outbreaks. It is widely rec¬ 
ognized that contact tracing is an integral component of Ebola interventions |1]. Our deter¬ 
ministic model explicitly links contact tracing to reported case transmissions, and incorporates 
the coupled effects of disease traits, passive case hnding (independent of tracing) and trac¬ 
ing/monitoring ehorts. The novel framework of the model is separating infectious cases that 
will be reported, further separating a fraction of their transmissions which will be traced and 
determining the action of tracers based on infectious stage of the traced contact. This captures 
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the dynamic structure of contact tracing. 

The explicit linking to transmission in our model brings out an interesting connection between 
observable contact tracing data and the effective reproduction Tlei yielding novel formulas for 
real-time estimates of TZe- We show that under the assumption of perfect tracking, along with 
either perfect monitoring of traced contacts or perfect reporting of all cases, IZe = k{l—q)/q+km 
where k and k^ are the average number of secondary infected individuals traced per primary 
reported untraced and trace case, respectively, and (1 — g)/g is the odds that a reported cases 
is not a traced contact. This formula ([^ can provide simple current estimates of IZf. in the 
population, as illustrated utilizing a limited dataset of weekly reported and traced Ebola cases 
in Guinea and Sierra Leone from mid January to April 2015. Our results indicate that contact 
tracing has likely been important in calming the epidemic for Sierra Leone over this time period, 
but had less impact on the disease dynamics in Guinea. Increasing contact tracing effort and 
efficacy has the potential to eliminate the disease at this later stage in the outbreak. The 
relative contribution of contact tracing in influencing the value of TZe is increasing with g, the 
fraction of reported cases that are traced contacts, and T^e —t as g —)■ 1. Therefore, the goal 
that WHO has set forth to bring the fraction of traced reported g to one in order to “eliminate 
all unknown transmission chains” is indicative of utilizing contact tracing to end the epidemic. 

There is less applicable data on contact tracing in 2014 when the Ebola outbreak was at a 
more critical stage in West Africa, and there are more questions about the efficacy of tracing 
during times of intense transmission as contact tracers may have been overwhelmed with high 
volumes of contacts to trace [3] . While further investigation into the efficacy of contact tracing 
in West Africa throughout the epidemic utilizing all data sources is needed, this is beyond 
the scope of our current work. However, we can utilize our sensitivity analysis and model 
simulations to gain insight on the impact and optimal deployment of contact tracing during 
different stages in the outbreak. First, our study suggests that a combination of interventions 
can work synergistically, which may help to alleviate logistical problems with contact tracing 
in high transmission areas. In particular, improvements in passive case Ending (faster and 
more frequent isolation independent of contact tracing) together with contact tracing are most 
efficient in reducing incidence. In this case, even with limited resources, contact tracing can 
play an important role in controlling hot zones. 

Gontact tracing is a multi-faceted strategy with multiple ways of impacting disease dynamics 
and distinct operational variables to consider. For instance, tracking the contacts and notify¬ 
ing/educating them about their status can reduce their secondary transmission if they develop 
symptoms, even if they are not isolated right away. Our model includes the details of the 
contact tracing process along with quantifying their relative contribution to the overall efficacy 
of the intervention. We find that the fraction of (high-risk) contacts traced per reported case 
is more important than delays in tracking (since the incubation period of Ebola is relatively 
long) and potentially the strictness of monitoring protocol (assuming their transmission and 
time to isolation is reduced enough). Optimizing the allocation of resources to tracing more 
individuals can possibly be achieved, for example, by checking up on traced contacts less often, 
but establishing trust and educating on the first visit to reduce the risk of that contact trans¬ 
mitting the disease. Another finding from simulations of our model is the dramatic difference 
that earlier intervention can make when applied before the number of cases is high. Gontact 
tracing effectively at the beginning of an epidemic substantially reduces cumulative cases and 
overall need of resources. Gertainly, it also vastly increases the probability of ending an out¬ 
break rapidly, as in the case of Nigeria and Senegal in 2014. Finally, contact tracing data can 
be utilized to evaluate the overall epidemic status and efficacy of interventions in real time in 
order to adjust efforts to the current situation. 

There are several limitations to our model, along with promising directions for future work. 
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As mentioned in the previous paragraphs, limited resources in the face of an inflating epidemic 
can severely restrict the impact of contact tracing. Although our results shed light on how to 
alleviate this problem through passive case hnding or adjusting monitoring protocols, it may 
be informative to explicitly incorporate tracing logistics in the model. Another future line 
of research will be to investigate whether formulas relating reproduction number to contact 
tracing observables, such as IZe = k{l — q)/q + k^, can be derived for contact networks, along 
with more generally extending our model to networks. In addition, further work will involve 
evaluating the accuracy of these formulas through simulated data from stochastic versions of 
the model. Also, other disease distributions, e.g. gamma-distributed incubation and infectious 
period, can be utilized to better match real data and generalize our approach. Our work 
establishes a theoretical framework to build upon for the mechanistic modeling of contact 
tracing and utilizing contact tracing observables to obtain simple real-time estimates of effective 
reproduction number, along with evaluating the impact of control strategies on an epidemic. 
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Appendix A. Calculation of effective reproduction number 

The effective reproduction number TZe can be dehned utilizing the next-generation approach 
[3^ . First dehne the feasible region for the system (|^ as 


T={^ = {S,E, El El 4,4,4^ e 


2=1 


i < Nq 


where denotes the non-negative orthant of and Nq is the initial total population size (of 
all compartments). We note that the system ([^ is quasi-positive, and thus its solutions remain 
non-negative when their initial values are nonnegative. Summing the right-hand sides of (|^, 
we hnd that iV'(f) < —7 (4(^) + 4(^) + 4(^) + -^c(^)) < 0 for a positive constant 7 . Thus the 
solutions the system (|^ remain in T when their initial values are in T. Notice that for initial 
susceptible population, Sq, Sq := {Sq, 0, 0,0, 0, 0, 0,0)^ is disease-free equilibrium of system (|^. 

We dehne a next-generation matrix by considering the linearized system at the disease-free 
equilibrium, Sq. Write the linearized “infection” sub-system as y' = {E — V)y, where E 
contains entries corresponding to new infections, and —V contains all other transition terms in 
the Jacobian matrix evaluated at So- Thus, we consider the following matrices: 


E 


So 


/O 

0 

0 

0 

0 

0 

VO 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


(l-0)/3 f3 

Pe(t)l3 0 

{l-Pe)(j)/3 0 

0 0 

0 0 

0 0 

0 0 


(1 - (p)l3m 
Pl<Pl3m 
(1 - Pl)(j)l3m 
0 
0 
0 
0 


(l-0)/3 \ 

(1 

0 

0 

0 

0 
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V 


/ ^ 000 0 0 0 \ 

0 i 0 0 0 0 0 

0 0 ^ 0 0 0 0 
-pi 0 0 1/T;, 0 0 0 

-(l-p)i 0 0 0 1/T„ 0 0 

0 -i 0 0 0 1/T^ 0 

\ 0 0^ -i 0 0 0 1/TiJ 


The next-generation matrix describing expected number of new infections (by the different types 
of infectious cases) is then dehned as K := FV~^ [32] • The effective reproduction number, IZe, 
is the spectral radius of K, g{K): 


7 ^, = g{K) = g{FV-^) 


Appendix A.l. Calculation ofTi ^ 

For individuals in the J* compartment, the average infectious period (prior to being tracked), 
Ti, can be approximated by considering the expected value of the time after symptom onset 
when a traced contact is tracked (given that they have progressed past their incubation period 
upon tracking), T). The formula for T) in the calculation of Tj (jh) is derived as follows: 


F = Th +6- 
=Th+6- 



E ( length of incubation period X \ infectious at tracking time) 


1 

— E ( X I infected at time r G [0, Tf) and X G [r, + S] ) dr 

Th Jo 


rTh+& 


Th + 5- 


xe 


— {x—r)/T 


e-Cr ds 


dx 


dr 


Here the notation E (X|^) denotes the expected value of random variable X given information 
A. Note that the compartment /* includes both individuals that are tracked when infectious and 
those that are tracked after infectiousness (removed by recovery, death or reporting independent 
of tracing). We also remark that clearly the waiting times corresponding to infectious period 
independent of tracing (with mean approximated by T) and contact tracking of infectious (with 
mean approximated by T)) are independent. Thus it is reasonable to treat the corresponding 
processes as independent exponentially distributed, and therefore to consider the sum of the rates 
in formula (|^. Also, note that T) can more accurately be calculated by also integrating over 
the index case infectious period probability distribution as with the calculation of pe, however 
to simplify, we assume a fixed infectious period Th for the index reported case. 


Appendix B. Formulae for IZe in special cases 


For the special cases of the model considered in Section 3.1[ consider 
traced individuals are always tracked during their incubation phase, i.e. 


the scenario in which 
Pe = 1. Then, we can 
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reduce the general system ([^ to the following system: 

= -(3S{h + Iu)-PmSI, 


E' = 


I'u = 


I'u 

(Ec)' 

iQ' 


PSIu + (1 - 0 ) fiSh + (1 - 0 ) (imSh - -E 
r n 


P 


E - ^L, 


ct>l3Sh + 00™^/c - -Ec 

T 

-Ec - T^Ic 
T Tm 


(A2) 


Here Ec{t) and Ic{t) are the exposed individuals who are contact traced (necessarily in their 
incubation phase since Pe = 1) and infectious contact traced (monitored) individuals, respec¬ 
tively. 

First, consider the scenario of “perfect tracing”, i.e. the case of perfect monitoring (0m = 0). 
In this case, since /3m = 0, the traced individuals do not transmit the infection, thus Ec and Ic 


compartments are decoupled from the first four equations in system (A2). Either by inserting 
/3m = 0 and Pe = 1 into the E matrix in the next-generation decomposition for the general 
model or calculating Tie in the reduced model (A2) with /3m = 0, we find: 


Tie — Tin 


pp^(l-0) + l-p 


(A3) 


where TZq = (3SqTu. 

Consider Ic{s)ds and Ih(s)ds, the cumulative number of reported contact traced 

cases and reported cases independent from contact tracing, respectively, from time t — a to t. 
Define the fraction of reported cases that are traced contact (from time t — a to t): 


Qait) 


^Iijcis)ds 


Tm E rp J^_^Ih{s)ds 


In addition, define the following constant q: 


Q = 


0 +(l- 0 )P+(l-p)^ 


As mentioned in Section 3.1, g can be interpreted as the conditional probability that a transmis¬ 
sion is traced given that this case is reported. Indeed, the probability that a case (transmission) 
is reported can be expressed as the sum of the probabilities of the three mutually exclusive 
events that the transmission was caused by a traced reported case, untraced reported case, and 
unreported case. These probabilities are the summands in the denominator (multiplied by p) 
of q. 

Proposition 1. For sufficiently small a and sufficiently large time t, 


qait) ^ q. 
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Proof 1. Note that 


^ E^{s)ds - Icit) + Icit - a) 

^ Ec{s)ds - Ic{t) + Ic{t - a) + 7 E{s)ds - 4(t) + 4(t - a) 

ft-a 4>/dS{s)Ih{s)ds + 'ipc{t) 

iLa 4>PS{s)Ih{s)ds + 'ipc{t) + P (/?S'(s)4(s) + (1 - (j))pS{s)Ih{s))ds + ijit) 
0 Jt_a^^(^)^h{s)ds + iJoit) 

{(j) + p(l - 0)) (3S{s)Ihis)ds + p (5S{s)Iu{s)ds + ^{t) + ^c{t) ’ 


wheretlj{t) = Ih{t—a)—Ihit)+E{t—a)—E{t—a) and'il’cit) = Icit—a)—Ic{t)+Ec{t—a)—Ec{t—a). 
Moreover 

f (pl'u - (1 - p)Ih)ds = f (^7^4 - ^Iu)ds. 

Jt—a Jt—a 

Then we obtain the following equality: 


Tup{Iu{t) - Iu{t - a)) + (1 - p){Ih{t -a) - 4(t)) 



plu{s)ds. 


Let define 9{t) = T„p(4(t) - 4(t - a)) + (1 - p)(4(^ - a) - hit)). 

For small enough a, we can assume Sit) is constant over the interval [t — a, t]- Then 

his)ds + fjcit) 

Qyt) = - 7 -—- 7 - 

/35(0 + p(l -(/>))/^_^4(s)rfs + /3^(l -p)^/^_^4(s)c?s + i/>(t) +'0c(^) -Oii) 

As in other outbreak models, it can be shown that the infection components converge to zero as 
t —)• oo. Thus 'ipit),'ipcit), 9it) ~ 0, when t is sufficiently large. Then for sufficiently large t, we 
obtain 


Thus inserting q and k = 
reproduction number 71^. 


dait) 


0+(l-0)p+(l-p)4 


= T 


(dSoThcj) into (A3), we find the following formula for the effective 


'}Ze = /3 ipThil - 0) + (1 - p)Tu) 

= Fpii - ,1,) +1 - p) 

0 



Next, consider the case of imperfect tracing (monitoring), f3m > 0, but perfect reporting, 
p = 1. As in the previous case, IZe reduces to a simple formula: 

4e = (1 - 0)4 + (fTZm, 

where 4 := fiS^T^ and 4^ = (dmSoTm. In this case, it is not hard to see that q = (f and we can 
similarly prove that fait) ~ q for large enough t and small enough a. Then, with k^ = 4^.0, 
we have 

4e = k ^^ + km,. 
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Appendix C. Base SEIR models 

Base model with single infectious compartment: 


S'{t) = -^S{t)I{t) 

E'{t) = (5S{t)I{t) - -E{t) 

T 

I'(t) = - h(t) (Bl) 

r 1 

H'(t) = ^I{t) 


Proposition 2. The two base models ^ and (Bl) are equivalent if = T and /^(O) = 

pl{0),lu{0) = (1 — p)/(0). In addition, the reproduction number of (Bl) is TZq = fIT, thus the 


reproduction number of models ^ and (Bl) are equal in general when T = pTh + (1 — p)Tu- 


Proof 2. For the solution to (j^, define I{t) = Ih{t) + Iu{t). Then I'{t) = \E{t) — ^I{t) and 
/(O) = //i(0) + /n(0) = /(O). Noting that the compartment H is decoupled in both systems, 
we can conclude that {S(t), E(t), I(t)) = {S(t), Eft), I(t)), where I(t) is the infectious solution 


to (Bl). Notice that {pi)' = -E — h^pl) and (p/)(0) = lh{0). Thus Ih{t) = pi ft) = pift)- 


Therefore, we also find that H'ft) = ^Ihit) = §^I{t) = ^I{t). This shows models and \B1^ 
are equivalent if T^ = Tu = T and //i(0) = pJ(0),/„(0) = (1 — p)/(0). The second statement is 
immediate. 


Note that numerical simulations show that the hnal size of the epidemics match for solutions 
to models Q and (Bl) when the reproduction numbers are equal, i.e. T = pTh + (1 — p)Tu 
and (3 is the same. In addition, numerics show that if T^ ~ T^i, then the transient behavior of 
solutions of the two models {Ift) and Ihft) + luft) are almost identical, however as |T/j — T„| 
increases, the transient dynamics become more distinct even though the hnal sizes will match. 


Appendix D. Additional Sensitivity Analysis Figures 



reported case,i^ and tracing i 


(a) (b) 

Figure D.l: Surface of effective reproduction number, TZe, for general model § when varying (a) p and f, (b) 
S and T. The parameters are as in Figure [7(a^ with Th = 3 and f = p = 0.75 in (b). 
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(a) (b) 

Figure D.2: (a) Global sensitivity analysis of TZe with respect to parameters using Latin Hypercube Sampling 
(LHS) and Partial-Rank Correlation Coefficients (PRCC). (a) Sensitivity analysis of cumulative total cases with 
respect to parameters using LHS and PRCC. 
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